Semiparametric efficient estimation for the auxiliary outcome problem with the conditional mean model

نویسندگان

  • Jinbo CHEN
  • Norman E. BRESLOW
چکیده

The authors consider semiparametric efficient estimation of parameters in the conditional mean model for a simple incomplete data structure in which the outcome of interest is observed only for a random subset of subjects but covariates and surrogate (auxiliary) outcomes are observed for all. They use optimal estimating function theory to derive the semiparametric efficient score in closed form. They show that when covariates and auxiliary outcomes are discrete, a Horvitz–Thompson type estimator with empirically estimated weights is semiparametric efficient. The authors give simulation studies validating the finite-sample behaviour of the semiparametric efficient estimator and its asymptotic variance; they demonstrate the efficiency of the estimator in realistic settings. Estimation semiparamétriquement efficace pour le problème du résultat auxiliaire dans le modèle à moyenne conditionnelle Résumé : Les auteurs s’intéressent à l’estimation semiparamétriquement efficace de paramètres dans le modèle à moyenne conditionnelle pour une structure de données incomplète simple dans laquelle l’événement d’intérêt n’est observé que pour un sous-ensemble aléatoire de sujets alors que les covariables et les variables de substitution (auxiliaires) sont observées pour tous. Ils font appel à la théorie des fonctions d’estimation optimales pour déterminer le score semiparamétriquement efficace de façon explicite. Ils montrent que lorsque les covariables et les variables auxiliaires sont discrètes, un estimateur de type Horvitz–Thompson à poids estimés empiriquement est semiparamétriquement efficace. Les auteurs présentent des études de simulation validant le comportement à taille finie de l’estimateur semiparamétriquement efficace et de sa variance asymptotique; ils démontrent en outre l’efficacité de cet estimateur dans des contextes réalistes.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Robust high-dimensional semiparametric regression using optimized differencing method applied to the vitamin B2 production data

Background and purpose: By evolving science, knowledge, and technology, we deal with high-dimensional data in which the number of predictors may considerably exceed the sample size. The main problems with high-dimensional data are the estimation of the coefficients and interpretation. For high-dimension problems, classical methods are not reliable because of a large number of predictor variable...

متن کامل

Semiparametric Efficiency in Gmm Models with Auxiliary Data By

We study semiparametric efficiency bounds and efficient estimation of parameters defined through general moment restrictions with missing data. Identification relies on auxiliary data containing information about the distribution of the missing variables conditional on proxy variables that are observed in both the primary and the auxiliary database, when such distribution is common to the two d...

متن کامل

SEMIPARAMETRIC EFFICIENCY IN GMM MODELS WITH AUXILIARY DATA By Xiaohong Chen,1 Han Hong2 and Alessandro Tarozzi

We study semiparametric efficiency bounds and efficient estimation of parameters defined through general moment restrictions with missing data. Identification relies on auxiliary data containing information about the distribution of the missing variables conditional on proxy variables that are observed in both the primary and the auxiliary database, when such distribution is common to the two d...

متن کامل

Semiparametric Efficiency in GMM Models with Auxiliary Data

We study semiparametric efficiency bounds and efficient estimation of parameters defined through general moment restrictions with missing data. Identification relies on auxiliary data containing information about the distribution of the missing variables conditional on proxy variables that are observed in both the primary and the auxiliary database, when such distribution is common to the two d...

متن کامل

Targeted Maximum Likelihood Estimation of Natural Direct Effect

In many causal inference problems, one is interested in the direct causal effect of an exposure on an outcome of interest that is not mediated by certain intermediate variables. Robins and Greenland (1992) and Pearl (2000) formalized the definition of two types of direct effects (natural and controlled) under the counterfactual framework. Since then, identifiability conditions for these effects...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2004